East Riding of Yorkshire
Mitigating Individual Skin Tone Bias in Skin Lesion Classification through Distribution-Aware Reweighting
Paxton, Kuniko, Dehghani, Zeinab, Aslansefat, Koorosh, Thakker, Dhavalkumar, Papadopoulos, Yiannis
Skin color has historically been a focal point of discrimination, yet fairness research in machine learning for medical imaging often relies on coarse subgroup categories, overlooking individual-level variations. Such group-based approaches risk obscuring biases faced by outliers within subgroups. This study introduces a distribution-based framework for evaluating and mitigating individual fairness in skin lesion classification. We treat skin tone as a continuous attribute rather than a categorical label, and employ kernel density estimation (KDE) to model its distribution. We further compare twelve statistical distance metrics to quantify disparities between skin tone distributions and propose a distance-based reweighting (DRW) loss function to correct underrepresentation in minority tones. Experiments across CNN and Transformer models demonstrate: (i) the limitations of categorical reweighting in capturing individual-level disparities, and (ii) the superior performance of distribution-based reweighting, particularly with Fidelity Similarity (FS), Wasserstein Distance (WD), Hellinger Metric (HM), and Harmonic Mean Similarity (HS). These findings establish a robust methodology for advancing fairness at individual level in dermatological AI systems, and highlight broader implications for sensitive continuous attributes in medical image analysis.
- Europe > United Kingdom > England > East Yorkshire > Hull (0.40)
- Europe > United Kingdom > England > East Riding of Yorkshire (0.04)
- North America > United States > New York > New York County > New York City (0.04)
- (2 more...)
- Health & Medicine > Therapeutic Area > Dermatology (1.00)
- Health & Medicine > Diagnostic Medicine > Imaging (1.00)
Evaluating Fairness and Mitigating Bias in Machine Learning: A Novel Technique using Tensor Data and Bayesian Regression
Paxton, Kuniko, Aslansefat, Koorosh, Thakker, Dhavalkumar, Papadopoulos, Yiannis
Fairness is a critical component of Trustworthy AI. In this paper, we focus on Machine Learning (ML) and the performance of model predictions when dealing with skin color. Unlike other sensitive attributes, the nature of skin color differs significantly. In computer vision, skin color is represented as tensor data rather than categorical values or single numerical points. However, much of the research on fairness across sensitive groups has focused on categorical features such as gender and race. This paper introduces a new technique for evaluating fairness in ML for image classification tasks, specifically without the use of annotation. To address the limitations of prior work, we handle tensor data, like skin color, without classifying it rigidly. Instead, we convert it into probability distributions and apply statistical distance measures. This novel approach allows us to capture fine-grained nuances in fairness both within and across what would traditionally be considered distinct groups. Additionally, we propose an innovative training method to mitigate the latent biases present in conventional skin tone categorization. This method leverages color distance estimates calculated through Bayesian regression with polynomial functions, ensuring a more nuanced and equitable treatment of skin color in ML models.
- Research Report > Promising Solution (0.48)
- Research Report > New Finding (0.46)
Predictors of Childhood Vaccination Uptake in England: An Explainable Machine Learning Analysis of Longitudinal Regional Data (2021-2024)
Noroozi, Amin, Esha, Sidratul Muntaha, Ghari, Mansoureh
Childhood vaccination is a cornerstone of public health, yet disparities in vaccination coverage persist across England. These disparities are shaped by complex interactions among various factors, including geographic, demographic, socioeconomic, and cultural (GDSC) factors. Previous studies mostly rely on cross-sectional data and traditional statistical approaches that assess individual or limited sets of variables in isolation. Such methods may fall short in capturing the dynamic and multivariate nature of vaccine uptake. In this paper, we conducted a longitudinal machine learning analysis of childhood vaccination coverage across 150 districts in England from 2021 to 2024. Using vaccination data from NHS records, we applied hierarchical clustering to group districts by vaccination coverage into low- and high-coverage clusters. A CatBoost classifier was then trained to predict districts' vaccination clusters using their GDSC data. Finally, the SHapley Additive exPlanations (SHAP) method was used to interpret the predictors' importance. The classifier achieved high accuracies of 92.1, 90.6, and 86.3 in predicting districts' vaccination clusters for the years 2021-2022, 2022-2023, and 2023-2024, respectively. SHAP revealed that geographic, cultural, and demographic variables, particularly rurality, English language proficiency, the percentage of foreign-born residents, and ethnic composition, were the most influential predictors of vaccination coverage, whereas socioeconomic variables, such as deprivation and employment, consistently showed lower importance, especially in 2023-2024. Surprisingly, rural districts were significantly more likely to have higher vaccination rates. Additionally, districts with lower vaccination coverage had higher populations whose first language was not English, who were born outside the UK, or who were from ethnic minority groups.
- Europe > United Kingdom > England > Lincolnshire (0.32)
- Europe > United Kingdom > England > Shropshire (0.15)
- Europe > United Kingdom > England > East Sussex (0.15)
- (47 more...)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Health & Medicine > Therapeutic Area > Vaccines (1.00)
- Health & Medicine > Therapeutic Area > Immunology (1.00)
- Government > Regional Government > Europe Government > United Kingdom Government (0.35)
Logical Consistency of Large Language Models in Fact-checking
Ghosh, Bishwamittra, Hasan, Sarah, Arafat, Naheed Anjum, Khan, Arijit
In recent years, large language models (LLMs) have demonstrated significant success in performing varied natural language tasks such as language translation, question-answering, summarizing, fact-checking, etc. Despite LLMs' impressive ability to generate human-like texts, LLMs are infamous for their inconsistent responses -- a meaning-preserving change in the input query results in an inconsistent response and attributes to vulnerabilities of LLMs such as hallucination, jailbreaking, etc. Consequently, existing research focuses on simple paraphrasing-based consistency assessment of LLMs, and ignores complex queries that necessitates an even better understanding of logical reasoning by an LLM. Our work therefore addresses the logical inconsistency of LLMs under complex logical queries with primitive logical operators, e.g., negation, conjunction, and disjunction. As a test bed, we consider retrieval-augmented LLMs on a fact-checking task involving propositional logic queries from real-world knowledge graphs (KGs). Our contributions are three-fold. Benchmark: We introduce three logical fact-checking datasets over KGs for community development towards logically consistent LLMs. Assessment: We propose consistency measures of LLMs on propositional logic queries as input and demonstrate that existing LLMs lack logical consistency, specially on complex queries. Improvement: We employ supervised fine-tuning to improve the logical consistency of LLMs on the complex fact-checking task with KG contexts.
- Europe > United Kingdom > England > Durham (0.14)
- South America > Brazil (0.04)
- Europe > Denmark > North Jutland > Aalborg (0.04)
- (7 more...)
- Research Report > New Finding (0.93)
- Personal > Honors (0.68)
- Leisure & Entertainment (1.00)
- Media > Film (0.46)
How English is YOUR hometown? Scientists reveal the place names that are the most 'archetypically English' - so, is yours on the list?
England is famous for its eccentric place names, from'Matching Tye' to'Fingringhoe' and'Upton Snodsbury'. But a new AI study now reveals the most English-sounding locations in the country – and they certainly conjure up images of cricket and afternoon tea. The study shows that'Harlington', a district of London, is the most archetypal English place name, along with'Widdington' in Essex and'Colworth' in West Sussex. It contrast, 'Anna', a settlement in Hampshire, is the least English-sounding, along with'Belgravia' in London and'Moira' in Leicestershire. Although AI was used to determine the language basis of English place names, not the meaning, the results could reveal more about the history of the locations.
- Europe > United Kingdom > England > West Sussex (0.28)
- Europe > United Kingdom > England > Leicestershire (0.26)
- Europe > United Kingdom > Scotland (0.06)
- (11 more...)
Britain's most amazing shipwrecks REVEALED: Underwater monuments to the UK's rich maritime heritage
A whopping 350 years after it sank off the coast of Norfolk, authorities have revealed on Friday that HMS Gloucester has finally been found. The'outstanding' ship, which sank on May 6, 1682 after hitting the Norfolk sandbanks in the southern North Sea, was uncovered 28 miles off the coast of Great Yarmouth half-buried on the seabed. But HMS Gloucester is just one of thousands of shipwrecks that litter the British coast, the majority of which haven't been seen by the human eye for centuries. It's thought nearly 40,000 wrecks could be waiting to be found off the British coast, according to Historic England, providing snapshots of the UK's rich maritime heritage. But at least 90 are known to exist and experts have pinpointed their location, although many likely won't ever be brought to land and could disintegrate to nothing in the decades to come.
- Europe > North Sea (0.56)
- Atlantic Ocean > North Atlantic Ocean > North Sea > Southern North Sea (0.25)
- Europe > United Kingdom > Scotland > Orkney (0.05)
- (15 more...)
- Government (0.96)
- Transportation > Marine (0.69)